-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PR1] refactoring :apps build and adding spark3.5/iceberg1.5 artifact #255
Conversation
7dcde77
to
3e7e046
Compare
Is this ready for review? |
@autumnust waiting on #221 to merge so i can rebase. this PR is forked from that. if you want you can compare the commit delta here: https://github.com/linkedin/openhouse/pull/255/files/770a6d0497b9af7b1bedb876cbf639b71b3a9822..94c5100d5aa780a373ee5d0e14394ecdcec2f561 |
81e11a0
to
ab39faa
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding the new module! One major request is to use iceberg-convention-1.2.gradle
and iceberg-convention-1.5.2.gradle
to avoid duplicate code.
buildSrc/src/main/groovy/openhouse.apps-spark-conventions.gradle
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for addressing the comments!
Summary
This is Part 1 of 2 for enabling compaction based MoR.
problem: enabling Merge-on-Read for customer tables requires the ability to compact delete files. Compacting delete files is supported in spark3.1/iceberg1.2 but requires us to test it. We cannot test it, because creating delete files is not supported in spark3.1 (only compacting existing ones).
solution: we add spark3.5/iceberg1.5 compaction application, which extends the existing compaction test suite. in a follow up PR, we will add tests which create delete files and compacts the delete files under the latest versions.
Changes
Testing Done
all tests running on spark3.1 and spark3.5:
correct spark version resolutions for test runtime:
correct spark version resolutions for runtime:
Additional Information
For all the boxes checked, include additional details of the changes made in this pull request.